Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Multimodal voice interaction
# Multimodal voice interaction
Ultravox V0 4
MIT
Ultravox is a multimodal voice large language model based on Llama3.1-8B-Instruct and Whisper-medium, capable of processing both voice and text inputs simultaneously.
Audio-to-Text
Transformers
Supports Multiple Languages
U
fixie-ai
1,851
48
Featured Recommended AI Models
Empowering the Future, Your AI Solution Knowledge Base
English
简体中文
繁體中文
にほんご
© 2025
AIbase